A System for the Semantic Multimodal Analysis of News Audio-Visual Content

نویسندگان

Vasileios Mezaris

Spyros Gidaros

Georgios Th. Papadopoulos

Walter Kasper

Jörg Steffen

Roeland Ordelman

Marijn Huijbregts

Franciska de Jong

Yiannis Kompatsiaris

Michael G. Strintzis

چکیده

News related content is nowadays among the most popular types of content for users in everyday applications. Although the generation and distribution of news content has become commonplace, due to the availability of inexpensive media capturing devices and the development of media sharing services targeting both professional and user-generated news content, the automatic analysis and annotation that is required for supporting intelligent search and delivery of this content remains an open issue. In this paper, a complete architecture for knowledge-assisted multi-modal analysis of news-related multimedia content is presented, along with its constituent components. The proposed analysis architecture employs state-of-theart methods for the analysis of each individual modality (visual, audio, text) separately, and proposes a novel fusion technique based on the particular characteristics of news-related content for the combination of the individual modality analysis results. Experimental results on news broadcast video illustrate the usefulness of the proposed techniques in the automatic generation of semantic annotations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Critical Visual Analysis of Gender Representation of ELT Materials from a Multimodal Perspective

This content analysis study, employing a multimodal perspective and critical visual analysis, set out to analyze gender representations in Top Notch series, one of the highly used ELT textbooks in Iran. For this purpose, six images were selected from these series and analyzed in terms of ‘representational’, ‘interactive’ and ‘compositional’ modes of meanings. The result indicated that there are...

متن کامل

A Comparison of Rule based and Distance Based Semantic Video Mining

In this paper, a subspace-based multimedia data mining framework is proposed for video semantic analysis, specifically video event/concept detection, by addressing two basic issues, i.e., semantic gap and rare event/concept detection. The proposed framework achieves full automation via multimodal content analysis and intelligent integration of distance-based and rule-based data mining technique...

متن کامل

People in videos from people in pictures

We propose an appearance based model for face recognition in news videos using an enormously large databank of still images. This is a step towards building an elaborate face-query system using multimodal audio-visual data. We use the fact that faces of the same person appear similar than of different people. We preprocess the videos, apply feature extraction, feature matching and a unique para...

متن کامل

Integrating multi-modal content analysis and hyperbolic visualization for large-scale news video retrieval and exploration

In this paper, we have developed a novel scheme to achieve more effective analysis, retrieval and exploration of large-scale news video collections by performing multi-modal video content analysis and synchronization. First, automatic keyword extraction is performed on news closed captions and audio channels to detect the most interesting news topics (i.e., keywords for news topic interpretatio...

متن کامل

Recent Advances in Video Content Analysis: From Visual Features to Semantic Video Segments

This paper addresses the problem of automatically partitioning a video into semantic segments using visual low-level features only. Semantic segments may be understood as building content blocks of a video with a clear sequential content structure. Examples are reports in a news program, episodes in a movie, scenes of a situation comedy or topic segments of a documentary. In some video genres l...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

EURASIP J. Adv. Sig. Proc.

دوره 2010 شماره

صفحات -

تاریخ انتشار 2010

A System for the Semantic Multimodal Analysis of News Audio-Visual Content

نویسندگان

چکیده

منابع مشابه

A Critical Visual Analysis of Gender Representation of ELT Materials from a Multimodal Perspective

A Comparison of Rule based and Distance Based Semantic Video Mining

People in videos from people in pictures

Integrating multi-modal content analysis and hyperbolic visualization for large-scale news video retrieval and exploration

Recent Advances in Video Content Analysis: From Visual Features to Semantic Video Segments

عنوان ژورنال:

اشتراک گذاری